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Abstract 

We study certain consistent families (F\)\>o of Galton- Watson forests with lifetimes as 
edge lengths and/or immigrants as progenitors of the trees in F\. Specifically, consistency 
here refers to the property that for each u < A, the forest has the same distribution as 
the subforest of F\ spanned by the black leaves in a Bernoulli leaf colouring, where each leaf 
of F\ is coloured in black independently with probability /x/A. The case of exponentially 
distributed lifetimes and no immigration was studied by Duquesne and Winkel and related 
to the genealogy of Markovian continuous-state branching processes. We characterise here 
such families in the framework of arbitrary lifetime distributions and immigration according 
to a renewal process, related to Sagitov's (non-Markovian) generalisation of continuous-state 
branching renewal processes, and similar processes with immigration. 

AMS 2000 subject classifications: 60J80. 
Keywords: Galton- Watson process, continuous- state branching process, random tree, immi- 
gration, age-dependent branching, geometric infinite divisibility, backbone decomposition 

1 Introduction 

Galton- Watson branching processes are a classical model for the evolution of population sizes, 
see e.g. [HE2]. More specifically, there is an interest in the underlying genealogical trees. In the 
most basic model, there is a single progenitor that produces i children with probability q(i) for 
some offspring distribution q on No = {0, 1,2,.. .}; recursively, each individual in the population 
produces children independently and according to the same distribution q. We represent this 
model by a graph-theoretic tree rooted at the progenitor, where each individual is a vertex and 
the parent-child relation specifies edges v — > w between parent v and child w. Vertices related 
to just their parent vertex and to no child vertices are called leaves. More precisely, we will 
follow Neveu [36] to distinguish individuals (see Section 12. ip . We will consider in this paper the 
following well-known and/or natural variants of Galton- Watson trees (see e.g. Jagers [24J): 

• GW(g)-trees as the most basic model just described; 

• GW(q, re)-trees as GW(g , )-trees, where each individual is marked by an independent lifetime 
with distribution k on (0, oo); this includes the case k = Exp(c) of the exponential lifetime 
distribution with rate parameter c G (0, oo); 

• GW(g, k, /3)-bushes as bushes (finite sequences) of a random number N of GW(g, «;)-trees, 
where N is Poisson distributed with parameter (3 £ [0, oo), in shorthand: N ~ Poi(/3); 

• GWI(g, k, rj, x)-forests as forests (point processes on the forest floor [0, oo)) of independent 
bushes of iVj GW(g, n)-tiees at the locations Si of a renewal process with inter-renewal 
distribution x on (0, oo), where each JVj, i > 1, has distribution 77 on N = {1, 2, . . .}. 
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With the common interpretation that individuals give birth only at the end of their life and 
that renewal locations are immigration times, all but the first model give rise to continuous- 
time processes counting the number Yt of individuals in the population at time t > 0. In these 
continuous-time models it is natural to take q(l) = 0, since an individual producing a single 
child at its death time can be viewed as continuing to live instead of being replaced by its child. 
Reduction by Bernoulli leaf colouring was studied in [11] and reads as follows in our setting: 

• independently mark each leaf of a tree T (with lifetimes), or of a bush B = (Tm , • • • , T<N)) 
or of a forest F = (B(t),t > 0) by a black colour mark with probability 1 — p G (0, 1), by 
a red colour mark otherwise; for this to be non-trivial, let q(0) > 0, as T then has leaves; 

• if there are any black leaves, consider, as illustrated in Figure HJ 

— the p-reduced subtree Tf^ r of T as the subtree of T spanned by the root and the 
black leaves (with lifetime marks inherited); 

— and the p-reduced tree T p ~ rdc derived from Tf7 r by identifying vertices via the 
equivalence relation generated by v = w for vertices in TF~7* Ac if v — > w and w is the 
only child of v in TF~7* Ac (marked by the sum of lifetimes in each equivalence class); 

— or the p-reduced bush B p ~ rdc = (Tf~s dc , . . . , Tf~ rdc ,) as the p-reduced trees associ- 
ated with the subsequence . . . , I NP -xdc) of trees in B that have black leaves; 

— or the p-reduced forest F p ~ rdc = (B p ~ rdc (t),t > 0) of p-reduced bushes. 




Figure 1: Black vertices are represented by solid circles and red ones by circle lines. 

It is easily seen that if T is a Galton- Watson tree, then given that there are any black leaves, 
the p-reduced subtree 7^ rdc and the p-reduced tree T p ~ rdc are also Galton- Watson trees [II] . By 
standard thinning properties of Poisson point processes, the p-reduced bush B p ~ rAc associated 
with a GW(g, k, /3)-bush B is also a Galton- Watson bush. We refer to the offspring distribution 
gP-rdc^ lifetime distribution /-c p ~ rdc and the Poisson parameter / gp _rdc Q f B p ~ rdc as the p- 
reduced triplet (q p ~ vdc , K p ~ rdc , /3 p_rdc ) associated with (q,K,/3), similarly for forests etc. It is not 
hard to find offspring distributions q that do not arise as p-reduced offspring distributions for 
any q (e.g. q(0) = q(3) = 1/2). More precisely, [IT] obtained the following characterisation. 

Theorem 1 (Theorem 4.2 of |11|). For an offspring distribution q, the following are equivalent: 

(i) There is a family (q\)\>o of offspring distributions with q\ = q such that q^ is the (1 — fi/X)- 
reduced offspring distribution associated with q\, for all < fi < A < oo. 

(ii) The generating function ip q of q satisfies 

oo 

Vq(s) = Y; Si( l( i ) = S + ^ 1 - S )> 0<S<1, (1) 

i=0 

where for some b G M, a > and a measure II on (0, oo) with f, Q ^(1 A x 2 )U(dx) < oo, 
^(r) =br + ar 2 + / (e~ rx - 1 + rxl {x<1} )Il(dx), r > 0. (2) 

J(0,oo) 
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In the setting of (i) and (ii), a consistent family (-Ba)a>o of GW(q\, Exp(ca), f3\)-bushes can 

be constructed such that (B^^Bx) = (B^ M ^ rdc ,B\) for all < fi < A < oo. For each 
c = c\ G (0,oo) and f3 = j3\ G (0, 00), the family {q\,c\, /3a)a>o unique, (q\)x>o does not 
depend on (c, /3), u>Me (ca)a>o depends on q but not on j3 and (/?a)a>o on q but not on c. 

In , this result is a key step in the construction of Levy trees as genealogies of Markovian 
continuous-state branching processes with branching mechanism tjj, where ijj is a linear transfor- 
mation of ip that we recall in Section 12.3,11 In the present paper we establish characterisations 
analogous to Theorem Q] for the other variants of Galton- Watson trees, bushes and forests. 

Theorem 2. For a pair (q, k) of offspring and lifetime distributions, the following are equivalent: 

(i) There are families (q\, k\)\>q with qi = q and K\ = n such that (q^, k^) is the (1 — /x/A)- 
reduced pair associated with (q\, k\), for all < fi < A < 00. 

(ii) The generating function ip q of q satisfies tp q (s) = s + ^{1 — s), where ip is of the form ([2]). 

Moreover, k is geometrically divisible in that there is a family (Xa ,j > 1) of independent 
identically distributed random variables and ~ Geo(a) independent geometric with 

parameter a, i.e. F(G^ = k) = a{\ - a) k ~ l , k G N, such that H h X^"^ ~ k 

• for all a > l/ip'(oo) ifip'(oo) < 00, where i(j'(oo) means \im r ^. OQ , ip'(r); 

• for all a > if ip'(oo) = 00. 

In the setting of (i) and (ii), a consistent family (-Ba)a>o °f GW(q\, K\, (3\)-bushes can be 

constructed such that {B^,B\) = (B^ rdc ,B\) for all < fi < A < 00. For each 

(3 = (3\ G (0,oo), the family (q\, K\, /3a)a>o ^ s unique, (q\)\>o does not depend on (k, f3) while 
(^a)a>o depends on q but not on j3 and (/?a)a>o on Q but not on k. 

The requirement on k set in the second bullet point is referred to as geometric infinite 
divisibility in the literature, see [30] . also Section [2.21 here. Since the distribution k = Exp(c) is 
geometrically infinitely divisible, Theorem [2] is an extension of Theorem [TJ 

Theorem 3. For a pair ((/, n) of offspring and immigration distributions, the following are 
equivalent: 

(i) There are families (qx,r]x)x>o with q± = q and r\\ = r\ such that (q^,^^) is the (1 — /i/A)- 
reduced pair associated with (q\,n\), for all < [i < A < 00. 

(ii) The generating function ip q of q satisfies (f q (s) = s + vp(l — s), where ip is of the form (J2J) . 
Moreover, the generating function ip v of rj satisfies 

00 

<Pr,(s) = J2s i rj(i) = 1-0(1- s), 0<s<l, 

i=l 

where for some d G M, and a measure A on (0, 00) with Jj Q ^(1 A x)A{dx) < 00, 

4>(r) = dr+ [ (1 - e~ rx )A(dx), r > 0. (3) 

In the setting o/(i) and (ii), a consistent family (F\)\>o ofGWl(q\, Exp(ca), Ex.p(h\)) -forests 

can be constructed such that (F^,F\) = (F^ rdc ,F\) for all < /i < A < 00. For each 
c = c\ G (0,oo) and h = hi G (0, 00), the family (q\,c\,r]\,h\)\>o is unique, (q\)\>o does not 
depend on (c,n,h), while (ca)a>o depends on q but not on (n,h), (t/a)a>o depends on q but not 
on (c,h) and (h\)x>o depends on (g, 77) but not on c. 
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The binary special case with single immigrants, where for some 9 > and all A > 

iia(i) = i, h A = V» 2 + 2\-e, 

leads to the setting of [38], where (-F\)a>o was shown to have independent "increments" expressed 
by a composition rule, and to converge to the forest in Brownian motion with drift —9. 

Theorems [2] and [3] describe in the same way the genealogy of associated continuous-state 
branching processes (CSBP) as Theorem [1] Specifically, for Theorem [2] the continuous-state 
processes are Sagitov's age-dependent CSBP(K,tft) based on a branching mechanism ip and 
the distribution of a local time process K, i.e. either an inverse subordinator or an inverse 
increasing random walk, see \27\ 140] and Section T3.3.2I here: for Theorem O they are CSBP with 
immigration, CBI(?/>, (ft), where (ft is an immigration mechanism, see |29U31j and Section [2.2l here. 

Proposition 4. Let (Z x ,t > 0) be the population size process in the setting of Theorem^ If 
ip'(0) > — oo, then 



z t x 



^-i(A) 



almost surely as A — > oo, for all t > 0, 



where (Zt,t > 0) is a CSBP(ET, ift) with Zq = f3, for some ift linear transformation of ip and 
K = (K s , s > 0) such that inf{s > 0: K s > V\} ~ k\ for V\ ~ Exp(ca) with c\ as in Theorem 

m 

Proposition 5. Let (Y t x ,t > 0) be the population size process in the setting of Theorem^ Then 
Y x 

1 Yt in distribution as A — )• oo, for all t > 0, 



^-i(A) 

where (Yt,t > 0) is a CT$I(ip, (ft) with Yo = 0, for ift and (ft linear transformations of tp and (ft. If 
furthermore ip'(0) > — oo and 4>'(0) < oo, then the convergence holds in the almost sure sense. 

These convergence results should be seen in the context of the large literature on space-time 
scaling limits of branching processes in discrete or continuous time, see [TDl [121 EH [2IJJ EH 
[371 140] . Convergence in distribution holds under much weaker assumptions on the families 
(q\> k A) A\)a>o or (Qx,c\,r]x,hx)\>o and invariance principles in varying degrees of generality 
have been obtained. It is also well-known that the convergence in distribution at a fixed time 
for Markovian branching processes implies the convergence in distribution of the whole process 
in the Skorohod sense of convergence of right-continuous functions with left limits. In |12j . 
joint convergence of processes and their genealogical trees is shown, also for a wider class of 
families (-Ba)a>0 suitably converging to bushes of Levy trees. The main contribution of the 
present work is to provide almost sure approximations of more general classes of continuous- 
state processes and consistent families of trees that contain full information about the genealogy 
of the population of the limiting continuous-state process, which is not contained in the limiting 
process itself nor in the approximating discrete-state branching processes. 

The structure of this paper is as follows. In Section [2] we formally set up the framework in 
which we represent trees, we recall preliminaries from Duquesne and Winkel [H] and develop 
a bit further some aspects that readily transfer and serve in the more general context here. 
We also provide some background about continuous-state branching processes with immigra- 
tion, and about geometric infinite divisibility. Section [3] presents the theory around Theorem 
[2] and Proposition [H while Section [4] deals with Theorem [3] and Proposition O In each setting 
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we provide explicit formulas for offspring distributions, lifetime distributions and immigration 
distributions as appropriate; we also provide explicit reconstruction procedures that reverse the 
reduction for the consistent families of bushes and forests and establish connections with back- 
bone decompositions (Theorem I22p and Levy trees. We finally deduce generalisations combining 
lifetimes and immigration. 

2 Preliminaries 

2.1 Discrete trees with edge lengths and colour marks 
2.1.1 Discrete trees and the Galton- Watson branching property 

Following Neveu [36], Chauvin [8] and others, we let 

U = |J N n = {0, 1, 2, . . . , 11, 12, . . . , 21, 22, ... , 111, 112, . . .} 

n>0 

be the set of integer words, where N = {1, 2, 3, . . .} and where N° = {0} has the empty word as 
its only element. For u = u\U2 ■ ■ ■ u n E U, and v = V1V2 ■ ■ ■ v m E U, we denote by \u\ the length 
of u, e.g. \u\U2 ■ ■ ■ u n \ = n, and by uv = U1U2 ■ ■ ■ u n v\V2 ■ ■ ■ v m the concatenation of words in U. 

Definition 6. A subset t C U is called a tree if 

• E t; we refer to as the progenitor of t; 

• for all u E U and j E N with uj E t, we have u E t; we refer to u as the parent of uj; 

• for all u E t, there exists u u (t) E No = {0, 1, 2, . . .} such that uj E t 44> 1 < j < f u (t); we 
refer to u u {t) as the number of children of u. 

We refer to the length \u\ of a word u E t as the generation of the individual u in the 
genealogical tree t. An element u E t is called a leaf of t if and only if v u (t) = 0. We consider 
the lexicographical order < on U and its restriction to t as the canonical total order. For u, v E U, 
we write u ^ uv, defining a partial order on U, whose restriction to t is the genealogical order 
on t. The partial order < is compatible with the total order < in that u ^ v u < q. A tree t 
in the sense of Definition [6] can be represented graphically as in Figure [2j 

Let T be the space of all such trees, and let T u = {t E T : u E t}. Then v u is a map defined 
on T u taking values in Nq. Note that T is uncountable. A sigma-algebra on T is defined as 



111 112 131 132 



11 



12 



13 



21 



221 222 



22 




(22,C22) 



Figure 2: On the left, t = {0, 1, 2, 11, 12, 13, 21, 22, 111, 112, 131, 132, 221, 222}, and on the right 
t = {(0, C ), (1, Ci), (11, C11), (12, C12), (13, C13), (2, C2), (21, C21), (211, Can), (212, C212), (22, C22)}. 
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T = a{T u ,u G U}. We also specify the nth generation 7r n (t) = {«6t: \u\ = n} = t n N n and 
set T n = cr{T u , \ u\ <n} = o-{ir m ,m < n}. 

We define the shift map/operator 9 U that assigns to a tree t its subtree t u = 9 u t above u G t: 

9 U :T U ^T, t i—T- u t = t u = {v <E V : uv £ t}. 

Clearly Tg = T and {u u > j} fl T u = T UJ -, also 0~ 1 (T„) = T nt) and 

T„ = {t G T: z/ ui t, 2 ... 1 ; fc (t) > Wfc+i for all < k < m} for v = v\V2 ■ ■ ■ v m G U. 

These relations allow us to consider a random tree r whose distribution is a probability measure 
P g on T, under which the numbers of children v u of the individuals u in the random tree are 
independent random variables with distribution q. More formally, P q is characterised as follows: 

GW(g)-trees and their branching property (see e.g. Neveu [36J ) 

(a) For any probability measure q on No, there exists a unique probability measure P g on 
(T, _F) such that P g (f0 = j) = q(j), and conditionally on {vq = j} for any j > 1 with 
lU) > 0' the subtrees 6>j, 1 < i < j, above the first generation are independent with 
distribution ¥ q . A random tree r with distribution ¥ q is called a GW(g)-tree. 

(b) Under P g ( • \T n , ir n = A), the subtrees 9 U , u £ A, above the nth generation are independent 
and with distribution F q , for all finite 4cN™ and n > 1 with P(7r n = A) > 0. 

For finite trees, in particular in the (sub)critical case K q (u$) = ^jeN J?0') — 1j measure P g 
can be expressed as 

P«({t}) =II^( t ))' foralltGT, 
net 

but this does not specify the measure F q in the supercritical case E g (f0) > 1, where P g assigns 
positive measure to infinite trees. Here, E g is the expectation operator associated with P„. For 
a GW(g)-tree r, the process G n = #7r n (r), n > 0, is known as a GW(g)-branching process. 

2.1.2 Marked trees and discrete branching processes in continuous time 

Let (H, 7^) be a measurable space of marks. We can attach a mark £ H to each vertex u of 
a given tree t. Formally, a marked tree is a subset 

tcllxl such that t = {u G U: (u,£ u ) G t for some £ u G H} € T (4) 

and where t n {u} x H = {(tt,£u)}> i.e. the map £: t — > M is unique. So a marked tree has the 
form t = {(it, £«) 6 U x H: u G t}. We denote by T H the set of marked trees. We consider the 
set of trees TjJ = {t 6 T H :u£t} containing individual u and note that : — > M is a map. 

For marked trees, we set v u (t) = v u {t) and t u = 9 u t = {(v, £ v ) : v G t u } = {(i>, : -ui> G t}. 
As sigma-algebra on T H we take one that makes t i— > t in (j4|) and t i-> £ u (t) measurable: 

^ e = a{T= H , u G U, /J G W}, where TjJ H = {t G : £ u g H}. 

For example, for EI = (0, oo), the marks can represent the lifetimes of individuals. We will 
later use H = (0, oo) x {0, 1} so that £ u = (Cu>7u) consists of a lifetime mark £ n G (0, oo) and a 
colour mark 7 n G {0, 1}. We consider a model when u G t will produce children at the moment 
of its death. The birth and death times a u and to u of each individual u G t, are then defined 
recursively by 

f «0 = 0, w = C0, 

\ a uj = uj u , uj uj = a uj + C U j, 1 < j < v u , u G t. 
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We denote by 7it(t) = {«Gt: a u < t < lo u } the set of individuals alive at time t > and define 
-^iP = cr{7fs) s < t}. For u G 7ft (t), we denote by 

0«,i(t) = {(0,w n -t)}U{(u,C™): w G t} 

the subtree of individual u above t. Figure [2] gives a graphical representation of a tree t where 
lifetimes are shown as edge lengths. 

In this formalism, we can define and study GW(q, «)-trees as GW(q) trees with independent 
lifetimes distributed according to a measure k on (0,oo): 

GW(g, «)-trees and their branching property (see Neveu |36j, Chauvin [8j) 

(a) For any probability measure Q on No x H, there exists a unique probability measure 
Pq on (T 11 , .F 11 ), such that (^0,^0) ~ Q and conditionally on \y§ = j, £ G H} for any 
j > 1, H G %, with Q({j} x H) > 0, the subtrees 0j, 1 < i < j, are independent with 
distribution Pq. For H = (0, 00) and Q = q ® k a random tree T with distribution Pq is 
called a GW(q, At)-tree. 

(b) Under P 9 gj /t (- IJ 7 ] 3 ,^ = A), the subtrees n G A, above time t are independent and 
distributed like 9$^ under Pq® K ( • IC0 > s), where s = t — a u is the (J^-measurable) age 
of u at time t, for all finite AcV with P(7f t = A) > 0. 

For further details including strong branching properties, we refer to [6JE]. For a GW(g, «)-tree 
T, the process Z t = #T t (T), t > 0, is known as a Bellman-Harris branching process [21 [3]. The 
Markovian special case for k = Exp(c) is also called a continuous-time Galton- Watson process. 



2.1.3 Coloured leaves, coloured trees and a two-colours branching property 

On a suitable probability space (n, A, P) , let T: (Q, A, P) -> (T^ ' 00 ) , j^ ' 00 ) , F q ® K ) be a GW(?, «)- 
tree. We assume q(0) > 0, i.e. T has leaves, and q(l) = 0, as an individual producing a single 
child can be viewed as continuing to live instead of being replaced by its child. 

Following we independently mark the leaves u of T with one of two colours, say red, 
P( 7u (T) = 1K(T) = 0) =p, or Macfc, P(7„(T) = 0|i/ u (T) = 0) = for some given p G (0,1). 

It will be convenient to also mark each non-leaf individual in black if the subtree above it 
has at least one black leaf, red otherwise. Such a marked tree r p_co1 is a random element of 
T H for H = (0,oo) x {0,1}. We denote its distribution by P^™ 1 . Note that it is not of the 
form Pq introduced in the previous section, because marks for non-leaf individuals will not be 
independent. We set 

g (p) = P^ ol ( 70 = 1) = P(TP" co1 has only red colour marks) = E[p #{ueT: "«( T )=°}]. (5) 

Branching properties of coloured GW(g, K)-trees (cf. Duquesne and Winkel 

(a) For all Borel-measurable k: (0, 00) — > [0,oo), j > 2, £j G {0,1} and ^^'-measurable 
fi : T m — > [0, 00), i = 1, . . . , j, we have 

j 

HC<t))Ylfi{0i);v<li = J';(7i,---,7i) = (ei,...,£i) 
i=l 

f k(z) K (dz) q(j) g(p)ir(l - g(p))^ nE^ ol [/ 4 | 70 = e t ] 
(0,00) f = \ 

where j r = e\ + ■ ■ ■ + Sj and j& = j — j r are the numbers of red and black colour marks. 
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(b) For all i > and ^-measurable f u : T H [0, oo) 



-col 



IT/- 



J* 



n e^ oi [/ u (^ iS )ic > s ] 



s=t—a u 



In the exponential case k = Exp(c), this simplifies to 



p— col 
g<8Exp(c) 



n uv«,t) 



•> t 



II E g®Exp(c) 



(6) 



ueirt 



Reduction procedure to identify the "black tree" in a two-colours tree 

• We can extract Tf u ~ rdc = {(u,(u) G U x (0,oo): («,C«,1) G T^ co1 }, the individuals of 
j^p-coi w ^h pjacjj colour marks. If r£7 r 7^ 0, we rename the individuals of ?^T rdc by 
the unique injection 

" Cb^ = e U: (u, Cu) G T^- dc } U, 

that is increasing for the lexicographical total order on U, maps onto an element T^ b rdc 
of T and is compatible with the genealogical partial orders. We refer to the image tree 
T^ dc = {(l(u)Xu)- u G T^ d °} as the p-reduced subtree ofT. 

• As a further reduction, we remove single-child individuals and add their lifetimes to the 
child's lifetime. Formally, we define r p_rdc = {i)£ ^ub^- ^(^b^) ^ 1}> anc ^ 

n 

Cu=J2 Cu 1 -u l {r^ dc ), where J u = sup{ j : u Ul ... Ut (T^~ h ldc ) = l for alH G {j, . . . , n - 1}}, 

i=Ju 

for all u = u\---u n G r p_rdc . Again, there is a unique injection J: r p_rdc — >■ U that 
is increasing for the lexicographical total order on U, maps onto an element r p ~ rdc of 
T and is compatible with the genealogical partial orders. We refer to the image tree 
rpp-idc _ {(l'(u),£ u ) : u G r p ~ rdc } as the p-reduced tree (or as the black tree). 

Figure Q] in the Introduction illustrates the reduction procedure. 

Remark 7. (a) The reduction procedure is transitive in that for independent colouring, we 
have (T(i-Pi)-rdc)(l-p 2 )-rdc © T^~P^" rdc . In particular, colouring for T^-P^~ vdc and 

T (l-p 3 )-rdc for < ^ can be coupled such that T (l-p 3 )-rdc = ( r (l^)-rdc)(HPs/£i)-«fc 

(b) Although we have used notation for a random GW(q, /c)-tree T p ~ co1 with leaves coloured 
independently, note that the reduction of T p ~~ co1 to a black tree is a purely deterministic 
procedure. Our focus here has been on the technical framework and how it is used to for- 
mulate relevant examples. We postpone to Section [2]3] the review of further developments, 
notably of the reconstruction/growth procedures that reverse the reduction. 



2.1.4 Bushes and forests — models with several progenitors and immigration 

Branching processes with immigration have been studied widely (see e.g. Athreya and Ney pQ 
and Jagers [23] )• We consider the model, where immigrants arrive at the times Si, i > 1, of a 
renewal process Jt = #{i > 1: Si < t}, i.e. where Sq = and the interarrival times Si — Si—i, 
i > 1, are independent and identically distributed random variables with a common distribution 
on (0, 00) that we denote by x- At each immigration time Si the number N{ of immigrants 
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is independent and has a common distribution rj on N. Each immigrant produces offspring 
independently according to the rules of GW(g, «;)-trees. 

Denoting by Z^_ s , the size at time t of the population of immigrants arriving at time Si, 

i=l 

is the total population size at time t > 0. Here, are independent sums of iVj independent 
Bellman-Harris processes with offspring and lifetime distributions q and k as in Section [2.1.21 

To capture the genealogical trees of the population, we use the notion of a bush as a random 
sequence B = (T^, . . . , T(j\n) of independent trees, and the notion of a forest as a point process 
F = (B(t),t > 0) of independent bushes 

B(Si) = B {i) , i > 0, B(t) = d, t<£ {S h i > 1} for a cemetery point d. 

GW(g, n, /3)-bushes and GWI(g, k, rj, x)-forests 

• A GW(<7, k, /3)-bush is a bush B = (Tn\, . . . , Trm) °f independent GW(g, K)-trees T(j\, 
where ~ Poi(/3). 

• A GWI(g, K, T), x)-forest is a forest F = (B(t),t > 0) where each bush B(Si) = B® 
is associated with immigration at the times Si of a renewal process with inter-arrival 
distribution x an d consists of an independent ^-distributed number N{ of trees T^j) . 

It is straightforward to transfer the notions of colouring and reduction to the setting of 
bushes and forests, since they apply tree by tree. We will slightly abuse notation and write 
u £ B to refer to individuals in a bush, instead of writing formally u = (i,u") with u" E Tuy 
Similarly, ueF means u = (t,u r ) with u' £ B(t) in the sense just defined. We will also abuse 
notation v u , Q u and j u accordingly for u G B and u € F. 

2.2 Continuous-state branching processes and immigration 

We have looked at branching processes with immigration in the discrete state space No = 
{0,1,2,...} in continuous time. In this section we will recall (Markovian) continuous-state 
branching processes with immigration, where the state space (population size) will no longer be 
No but [0, oo), and time is also continuous. 

2.2.1 Subordinators and geometric infinite divisibility 

Definition 8. An increasing right-continuous stochastic process a = (a(t),t > 0) in [0, oo) is 
called a subordinator if it has stationary independent increments, i.e. if for every u, t > 0, the 

increment a(t + u) — a(t) is independent of (u(s), s < t) and a(t + u) — o~(t) = a(u). 

The distribution of a subordinator a on the space B([0, oo), [0, oo)) of functions /: [0, oo) — >■ 
[0, oo) that are right-continuous and have left limits equipped with the Borel sigma-algebra 
generated by Skorohod's topology, see e.g. [23], is specified by the Laplace transforms of its 
one-dimensional distributions. For every t > and r > 0, 

E(exp{-ra(t)}) = exp{-t0(r)} (8) 
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where the function (ft: [0,oo) —> [0,oo) is called the Laplace exponent of a. There exist a unique 
real number d > and a unique measure A on (0, oo) with J(l A x)A(dx) < oo, such that for 
every r > 

cj)(r) = dr+ I (l-e- rx )A(dx). (9) 

J(0,oo) 

Conversely, any function <j> of the form ([9]) is the Laplace exponent of a subordinator, which can 
be constructed as (dt + ^2 s<t Acj s , t > 0) for a Poisson point process (Acr s , s > 0) in (0, oo) with 
intensity measure A. Equation is referred to as the Levy-Khintchine representation of (j>. We 
refer to Bertoin [5] for an introduction to subordinators and their applications. 

Definition 9. A random variable X is geometrically infinitely divisible (g.i.d.) if for all a G (0, 1) 

x^E^, ;>i, (io) 

3=1 

for a sequence Xq , j > 1, of independent identically distributed (i.i.d.) random variables and 
an independent ~ Geo(a): 

P(G (Q) = fe) = a(l -a) fc ~\ fc>l. 

For example, an exponential random variable X ~ Exp(c) is g.i.d. since (|10[) holds for 
Jo' ~ Exp(c/a). The class of g.i.d. distributions can be characterised as follows. 

Lemma 10 ([30]). A random variable X is g.i.d. if and only if it can be expressed as X = o~(V), 
where a = (o~(t),t > 0) is a subordinator and V ~ Exp(c) independent of a for one equivalently 
all c G (0, oo). 

Indeed we can then express (V and hence) X as 
V = and X = <j v = Y, ( CT + • • • + Vl 3) ) ~ ° (V Q (1) + • • • + V a~ l) 

3=1 3=1 

Lemma 11. If X is such that (|10p holds for some a G (0, 1), the distribution of Xa is unique. 
Proof. E (e -^) = E < e ' rtf '» a „, * E(.-iV-*^L. □ 
2.2.2 Continuous-state branching processes 

Continuous-state branching processes were introduced by Jirina [26] and Lamperti [33] . They are 
the limiting processes of sequences of rescaled Galton- Watson processes as the initial population 
size tends to infinity and the mean lifetime tends to zero. In this section we follow Le Gall 



Definition 12. A continuous-state branching process (CSBP) is a right-continuous Markov 
process (Zt,t > 0) in [0, oo), whose transition kernels Pt(x,dz) are such that for every t > 0, 
x > and x' > we have 

P t (x + x',-) = P t (x,-)*P t (x',-), 
where * denotes convolution. In other words, if for a given transition kernel we denote, for 
each x > 0, by Z x a CSBP starting from Zq = x, then for Z x = Z x independent, we require 

Zf + Zf' ( = 5 z x+x '. 
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The transition kernel is specified by the Laplace transforms 



E(exp{-rZ t }\Z = x) = exp{-zu 4 (r)} (11) 

where ut- [0, oo) — > [0, oo). In fact, (t,r) \-t ut(r) is necessarily the unique non-negative solution 
of 

du t (r) 



u t {r)+ I %(j(u s (r))ds = r or — = -ip(u t (r)) (12) 



o 



with Uo(r) = r, for some ip: [0, oo) — > R of the form 

^(r) = br + ar 2 +/ (e _r:r - 1 + rxl {a , <1} )II(dx), (13) 

where b 6 M, a > and II is a measure on (0,oo) with |(1 A x 2 )H(dx) < oo and where Tp 
satisfies the non-explosion condition J Q+ |-0(r)| _1 dr = oo, see [20]. Equation (|13j) is referred to 
as the Levy-Khintchine representation of tp. The process (^t,t > 0) is then called a CSBP with 
branching mechanism ip, or a CSBP(^). We denote by P5, the distribution of (Zf,t > 0) on 
0( [0, oo), [0, oo)). There also exists a sigma-finite measure 0^, on B([0, oo), [0, oo)), such that 

{zf,t> o) { = } Yl E t(y)^> h * > o, 

\0<j/<a' / 

where (E(y),y > 0) is a Poisson point process in B([0,oo), [0,oo)) with intensity measure 0^,. 
2.2.3 Continuous-state branching processes with immigration 

Similarly, a discrete-state branching process with immigration has a continuous analogue, the 
continuous-state branching process with immigration, CBI for short. Following Kawazu and 
Watanabe |29j . see also |10t I31|. besides the branching mechanism ip for a CSBP, we also have 
an immigration mechanism (J) of the form ([9]) for the CBI, which we then refer to as CBI(ip , (f)) . 

A CBI(ip,4>) is a Markov process (Yt,t > 0) on [0, oo) whose transition kernels are charac- 
terized by their Laplace transform, which in terms of (ft and itj(r) as in (112p satisfy 



E(exp{— rY" t }|Yo = x) = exp /— xu t (r) — J 



4>(u s (r))ds 



In fact, a subordinator a = (a(t),t > 0) with Laplace exponent 4> can be viewed as a pure immi- 
gration process CBI(</>, 0). Indeed, a general CBI(^,0) is such that by time t>0a population 
of total size a(t) has immigrated and evolved like a CSBP(^); specifically, consider a Poisson 
point process (E s , s > 0) in D([0, oo), [0, oo)) with intensity measure dO^ + Jj ^ P^A(cfe), then 
in analogy with ([7]) 

s<t 

is a CBI(^, (f)). This follows by the exponential formula and properties of Poisson point processes. 

Examples of continuous-state branching processes with immigration include (sub)critical 
CSBP conditioned on survival, see [32] and literature therein. 
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2.3 Growth of Galton- Watson bushes with exponential edge lengths 

In Theorem [H we consider families of GW(qai Exp(ca), /3A)-bushes B\, A > 0, with the consis- 
tency property that any two bushes, for parameters \i < A say, are related by p-reduction as 
formally defined in Section [2.1.31 f° r P = 1 — ///A. The choice of p is dictated (up to a positive 
power for the ratio) by the consistency requirement that the relation holds for all /i and all A 
(cf. Remark Efa)). 

The equivalence of (i) and (ii) in Theorem [1] is a statement purely about offspring distri- 
butions ((/A; A > 0). The reason for including the other two parameter sequences (c\,\ > 0) 
and (f3\, A > 0) in the remainder of the statement is simplicity. Specifically, if we consider trees 
without lifetimes and hence without embedding in time, the removal of single-child individuals 
will be more artificial as it reduces the height of the trees; if we look at trees instead of bushes, 
the reduced tree will only be GW if we condition on the existence of at least one black leaf, and 
this does not lead to consistent families of random trees on the same probability space. 

In |11] , the bushes of Theorem [T] are used to construct Levy trees as a strong representation 
of the genealogy of the limiting CSBP(^) under some extra conditions on ip. In a weaker sense, 
the family (B\, A > 0) itself is already a representation of the genealogy of CSBP (-(/>), under no 
conditions on ip other than ip(oo) = oo to exclude the case of increasing CSBPs corresponding to 
"no death" i.e. "no leaves" . Roughly, B\ is the genealogy of a Poisson sample chosen among all 
individuals with intensity proportional to A; as A — > oo, the set of individuals included becomes 
dense. 



2.3.1 Consistent families of GW(q\, Exp(ca), /^-bushes 

In |11| . the parameters in a consistent family of Galton- Watson bushes are represented in terms 
of a branching mechanism ip that is just required to satisfy ip (oo) = oo, so that ip is eventually 
increasing and has a right inverse ip~ l : [0, oo) — >• [^ -1 (0),oo): 

^ (S) = S + |^AW^' ^'(^(A)), /?a = ^- 1 (A). (14) 

It follows from the derivation there that this ip and (ip, c) in the statement of Theorem Q] are 
related in a linear way as 

ip(r) = kiifj(k 2 r), (15) 

where h = 1/^(1) = ip~ l (l)ip' '{ip' 1 (1)) and k 2 = c0(l) = l/^ -1 (i). For the underlying CSBPs, 
the relationship (fl~5j) just means Z t = k2Zk 1 k 2 t, so ip and ip essentially refer to the same CSBP. 
With this parameterisation, ([5]) can be expressed more explicitly for q = q\ and p = 1 — fi/X as 

g x (l - /./A) = P 9A0Exp(CA) (7 = 1) = 1 - = 1 " • (16) 

2.3.2 Analysis of the reduction procedure for GW(q\, Exp(ca), /^-bushes 

In this section we will study some key points in the reduction procedure in the setting of Theorem 
[TJ These will be important for the proofs of Theorems [2] and El There are three steps in the 
reduction procedure from A to \x < A: colouring with p = 1 — fJ,/X, passage to the p-reduced 
sub-bush and passage to the p-reduced bush. 

In the last step, the lifetime (,u rd ° of individual u G T^ 1 " ^ rdc is obtained combining 

a number of lifetimes of T\ when removing the single-child individuals. By the two-colours 
branching property in Section 12.1.31 the random numbers Gu are Geo(a), where a is the 
probability that an individual in T^?^' rdc produce zero or more than two children. We can 
express a in terms of ip: 
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Lemma 13. Let < \i < A < oo. Given that u € rdc , we have G^ ~ Geo(a) with 



« = P(-0(^ A) " rdC )^l 



,(l- M /A)-col\ _ n \ _ H^)) 



7,(^—1=0)=^^. (17) 



Proof. According to the definition of a and the two-colours branching property of Section (2TL3] 



1-a = F(, (Tl^ rdc )=l 



70(T a-/VA)-coij =0 



= ^ a ( 3a (1-m/A)) 

and by (HI]) and CED we obtain 1 - a = 1 - D 

In the reconstruction procedure reversing the reduction procedure, we will therefore subdivide 
each lifetime in the GW(q^, Exp(c At ), /3 A1 )-bush into a geometric random number of G^ parts. 

Remark 14. By the branching property of coloured GW(q, «;)-trees in Section 12.1.31 lifetime 
marks are independent of colour marks. Therefore, Lemma [13] also holds for general B\ ~ 
GW(g^, K\, /3 / \)-bushes, not just for GW(^, Exp(c^), /3A)-bushes. 

In the case of Exp(c A1 ) lifetimes, given the lifetime Cu(Tu) the conditional distribution of the 
random variable N u = Gu — 1 follows a Poisson distribution: 

Proposition 15. Let Q be a random variable having distribution Q ~ Exp(ac). Suppose that £ is 
subdivided into G^ independent parts, ( = Ci + - ■ •+Cg( q )' where Q ~ Exp(c) andG^ ~ Geo(a) 
are independent. Then given ( = z for z > 0, we have 

noM=tK= , )= Ki-^y^ , (18) 

Proof. This is, of course, well-known in the context of Poisson processes, but let us give a 
direct argument and write the left hand side as a conditional expectation ¥(G^ = k \ Q = z) = 
E{l {G{a)=k} \( = z). We also set g k {z) = ((1 - a)cz) k ~ 1 e-^-°^ cz / '{k - 1)!. 

Claim: E(/(C)^(C)) = E(/(C)l {G W= fc }) f ° r a11 measurable / > 0. 

f°° f°° nic((\ — cy)cz) k ~ 1 e~ cz 
As C ~ Exp(ac), E(/(C) 5fe (C)) = J f(z)g k (z)ace- acz dz = J f{z) U >_ ± dz. 

On the other hand, since C = Ci + " " " + CgW > ^ ne conditional distribution of £ given 
is Gamma(c,fc). Therefore, 

E(/( C )l {GM=fe} ) = P(G< a > = fc) /(z) ^^ cfa = /(^) aC((1 " (fc a ^" le " CZ ^, 

and so ^(z) is a version of the conditional probability P(G( a ) = k \ £ = z), as claimed. □ 

Moreover, we can also find the conditional joint distribution of the lifetimes (Ci, • • • , Cg( q )-i) 
given ( = z and G^ = k as follows: 

(k - 1)' 

/ci,C 2 ,..,a-i|G(«)=fc,C=*(2/i> 2/2, • • • , ifc-i) = yfc _x ' (19) 
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for all 2/1 > 0, ... , yu-i > such that y% H + yfc-i < z. 

In the middle step of the reduction procedure, all red individuals are removed. |1 1 J noted 
that it is a consequence of the two-colours branching property, see Section [2.1.31 here, that they 
form independent GWfg^JJ^' co1 , Exp(cA))-trees, "red trees", where 

^(^(A-^Xl-s)) 

ip (i- M /A)-coi(a) = s+ g \T' with = + - ^ ( 20 ) 

Furthermore, the numbers of red trees removed at the branchpoints are conditionally indepen- 
dent, and given that a branchpoint has m > 1 subtrees containing black leaves, the generating 
function of the number of red trees can be expressed in terms of the mth derivative of tp^: 

tf^Vv^A-^U-s)) f ^ o ^~\\-n){l-s))-^-\\-v)) 

- — -. — : for m > 2 or — — - , — - tor m = 1. 

^ m) (0) W 1 ^-!')-^) 

(21) 

2.3.3 Reconstruction procedure for GW(gAj Exp(ca), /?A)-bushes 

Let 5 M = {T^\rf\. . .,T^ N,i) ) be a GW(? M , Exp(c M ), /3 At )-bush. We will construct B\. 

(i) (i) 

1. For each individual u in , given the lifetime Q =-25 subdivide into a random number 
Gu*'*^ of parts with distribution (1181) . where a is as in (fTTl) and the parts (Cjfl, . . . , C W („,«) 
have joint distribution f)19|) . Now for each ? G {1, . . . , A^}, there is a unique injection 

(4)- 1 :r( i ) = {«eU:(«,C«)eT®}^U 

such that = lfc wi t h fc = G^' i] - 1 and (4) _1 (uj) = ^)~ 1 (u)jl k with fc = 

— 1, for all iy G where l fc is a string of letters 1. We define 

2. For each individual u in , given Vu = m > 1 children, consider a random number G« 
of further children with distribution f)21 1) and a uniform random permutation among 
the (m + C$)\/m\ permutations with ^(1) < • • • < g$(m). Let T^' 1 '*), . . . ? y^M with 

= G-u be independent GW(g[ 1 re ^ A ' rdc , Exp(cA))-trees with offspring distribution (|20j) . 
Then for each i G {1, . . . , A^}, there is a unique injection 

( ti )-i : f« = {„ G U:(«,^))Gf»}^U 
such that {a)~ l {%) = and = for all uj G r^. We define 

3. Given = n, consider a random number 7VT ~ Poi(/?A — /9u) of further progenitors and 
a uniform random permutation g among the (n+A^ ed )!/n! permutations with g(l) < • • • < 

gin). Let f{ n+1) , . . . , f ( n+fc ) with k = N{ cd be independent GW( ? J re f )_rdc , Exp(c A ))- 
trees with offspring distribution (|20p . Then we finally define 



B A = (Tjf . . . , with n = A 7 ^ and k = N{ cA . 
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Remark 16. (a) The constructed bush is indeed a GW(q\, Exp(ca), /3^)-bush and the pair 
obtained (-£?«, B\) has the same distribution as (B^ rdc , B\). The intermediate trees 
have the same distribution as the (1 — /i/A)-reduced subtrees, also jointly with the 
pair. If the split into two parts in the definition of is used to assign black colour marks 
to the first part and red colour marks to the second part, then the resulting trees have the 
same distribution as the (1 — ^/A)-coloured trees B^ co1 . We refer to Figure [T] as a 
graphical illustration of the reduction and hence the reconstruction procedure. 

(b) In [11], this reconstruction procedure is also formulated for representations of the trees in 
a space of M-trees, tree-like metric spaces that we briefly address in Section [3.31 

(c) It is a simple consequence of the reduction procedure and/or the reconstruction procedure 
that (B\, A > 0) is an inhomogeneous Markov process in a suitable space of finite sequences 
of (0, oo)-marked trees. Similarly, Q can be strengthened in the present context to 



e( n fu(e u ,{B<f ,x) - C(A 



uew t (B x ) 

(l-/VA)-col 



where Q\± = cr {tt s (B\i), X' > A, s < t}, A > 0, t > 0, and B x is as in (a). 

2.3.4 Limiting behaviour as A — > oo 

In the context of limiting results as A — > oo, we record the following corollary of Lemma [13l 
Corollary 17. If we fix /x > in the setting of Lemma [T3l we have 

where if)'(oo) means lim^oo t//(A), in the following sense: 

• // ip'(oo) < oo, then a(/u, A) — > ao(fi) = ip' (tp -1 (fi)) / tp' (oo) . 

• Iftp'(oo) = oo, then a(fi,X) — > ao(fi) = 0. 

Note that this means that as A — > oo, lifetimes are cut into finite Geo(ao(/i))-distributed 
numbers of pieces in the first case, but into infinitely many pieces in the second case. 

Lemma 18. Let (Z^,t > 0), A > 0, be continuous-time Galton- Watson processes associated 
with a consistent family (Bx)\>o, of GW(q\, Exp(c^), (3\)-bushes as in Theorem^ Then 

z£ —7- Zt in distribution, as A — > oo, for all t > 0, 



^r-i(A) 

where (Zt,t > 0) is a CSBP(^) starting from Zq = f3, with if) as in (|14p . If furthermore 
ip'{0) > —oo, then the convergence holds in the almost sure sense. 

Proof. First consider t = 0. The initial population sizes are Poisson distributed with 

E(exp{-rZ A /^~ 1 (A)}) = exp{/3V~ 1 (A)( e - r/ ^" 1(A) - 1)} -> e~ Pr , 
as A — > oo, since ip'~ 1 (X) — > oo. 
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For t > 0, the desired limiting distribution is characterised by (jlip in terms of Ut(r) and for 
x = (5. If we integrate (j!2p . we can identify ut(r) as the unique solution of 



dv 

ut(r) 



(22) 



Consider Z^/ip' 1 ^) as a sum of a Poisson number of independent GW(q\, Exp(cA))-processes. 
Set s = e~ r /^ ^ and apply the branching property at the first branching time of a single 
GW(q\, Exp(cA))-tree to obtain for its population size X x at time t with w x (s) = ¥,(s Xt ) 



w. 



( S ) = se -CA* + / y2 qx (k)(E(s x t-y)) k c x e- c ^dy = se-^ + / ^(u^( 8 ))c A e- 
fc=0 Jo 



Now apply fjl4|> . multiply by e CA *, differentiate with respect to f and rearrange to get 



^-HA)(l-^ A (s))) 



^- 1 (A)(l-«) t A (s)) 



(23) 



For s = e _r ^ 'W, we have - s) -> r and by (J2ZD also ^ _1 (A)(1 - -> «t(r) and 

then 

E(exp{-rZ 4 A /^~ 1 (A)}) = exp{-/?V _1 (A)(l - w x (s))} -> as A oo, 

as required. 

For the proof of almost sure convergence, recall our notation 7ft (T) C U for the population 
alive at time t of a tree T, which we will slightly abuse and also apply to bushes. Let < fi < A, 
p = 1 — fx/X and Q\ >t = a {W s (By), X' > A, s < t}. Now note that Z x /ifj~ 1 (X) is £/A,t- m easurable, 
and that ip'(0) > — oo ensures that K(Z^) < oo. Then, a.s., 



E(^|g v )=E( £ i {7u 



(B p x - col )=0} 



«e7r t (_B A ) 



(b;- co1 )=o} 



since = ^-Kt(B^). By the branching property in Remark 116(c). applied to functions /«(•) 
l{ 70 (.)=o}> an d fv(-) = 1 for i> 7^ u, we obtain, a.s., 



L {74Br col )=o> 

«67ft(Bx) 

According to (fl6j) . this shows that 

E 



g x A = Z^Fh 9 (Bm = 0) = (1 - g x {p))Zl 



jp— col 



^(m) 



^-i(A) 



a.s., 



which is the martingale property in the (decreasing) filtration (Q\t, X > 0) that implies that 
Z x /tp~ 1 (X) — > Zt almost surely as A — > oo, see e.g. [32]. □ 



3 Growth of GW(q\, K\ , (3\) -bushes: lifetimes 

Theorem [2] considers Galton- Watson bushes B\ with general independent /^-distributed life- 
times. The main statement beyond the exponential case of Theorem [1] is that consistency of a 
full family (B\, A > 0) under Bernoulli leaf colouring requires k\ to be geometrically divisible. 
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3.1 Proof of Theorem d 

(i) =^ (ii): Suppose, (i) holds. In particular, is then the (l—/z/A)-reduced offspring distribution 
associated with q\, for all /i < A. By Theorem [H (f q (s) = s + — s), where tp has the form 
([2]). To be specific, let c = 1, /3 G (0, oo) and parametrise (q\,/3\) using ip as in Section [2.3,11 

Consider a GW(g,\> K^)-tree T\. By Remark I14[ Lemma [131 applies. Using notation similar 
to the reduction procedure of Section [2.1.31 for T = T\, p = I — 1/A, l = l\, A > 1, we can write 
C0 ~ Ki as 

c = e e^E^, (24) 



for 



> 1, and ~ Geo(a) with a = ^'(^(l))/^'^ -1 ^)) independent. 

Specifically, consider T n = {G^ = n}, where = '-^ 1 (0) in the notation of Section [2.3.21 On 
T n , write v o = and for 1 < j < n—1, denote by Vj the unique black child of Vj—i, then an ra-fold 
inductive application of the two-colours branching property of Section [2.1.31 also summing over 
all offspring numbers and colour combinations as in Lemma [T3l yields for p = 1 — 1/ A 



p— col 

?A® K A 



n-l 
j=0 



, r 



70 : 




fa>(*)"A(<kO¥^ A (5a(p))E^ a ^ a 



n-2 

n fcj+i(Cyj);r n -i 

i=o 



70 : 



(0,oo) 



kj(z)nx(dz) (1 



By Corollary 1171 a I ■0'(-0~ 1 (l)) /^'(oo) = l/ij)'{p6) as A -> oo. Therefore, we can write £0 ~ K4 
as in (|24p for all a > 1 ftp' (00), with the convention l/^^O) = if ^'(O) = 00. This yields (ii). 

(ii) => (i): Suppose, (ii) holds. By Theorem [H the family (q\,X > 0) exists as required. By 
Theorem [T] and Remark fT4~l we can express (p qx and a in terms of tp as in Section 12.3.11 and 
Lemma [131 choosing c = 1. 

(7) 

For A > 1, geometric divisibility ^>f k permits us to define k\ as the distribution of Xa 
for a = V^W^COWW' -1 ^)) = 1/V' / (V'~ 1 (^(1))) > W(oo). Now consider a GW(g A ,K A )- 
tree and p = 1 — 1/A. Use notation from (i) => (ii) and also set v* 



On T 



{v v 



j} for j > 2, denote by wi, ■ ■ ■ , Wj the black children of v* . Then, 



by Remark [TH and repeated application of the two-colours branching property, we obtain 



exp < 



G, 



(a) 



£ Cv m \llfi(e 



m=0 



, T • 



i=l 



78 



E 



'(0,00) 



<K A (dz)| {1-aT- 1 ^{g x {p)) 



{i-gxip))^ 1 



He 



0]. 



1=1 



This is the branching property characterizing GW(g, k), because the first term is the Laplace 
transform of a geometric sum with distribution k, up to a factor of a, and for the middle term 
we identify the offspring distribution q using all the cancellations due to (fT4"j) , (fTB]) and (fTT|) 



J=2 



_ f l yg(gx(p)) , 



1 + 



1-9a(p)) 



j-1 



+ 1 



J=2 



¥UflA(p) + «(1 ~ gA(p))) - 1 - (1 - g)^ A (gA(p))(l ~ 9x(p)) 

«(1 -9\(p)) 



s + 



^-1(1)^(^-1(1)) 
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confirming that (q,K) is the (1 — l/A)-reduced pair associated with (q\,K,\). For fi < 1, set 
a = ij)' (ip -1 (//)) I 'ip' (1)) and define to be the distribution of 

for independent ~ k, j > 1, independent of ~ Geo(a). 

As above, (q^,K^) is the (1 — unreduced pair associated with (q,n). The reduction relation 
for < /j, < A < oo follows using transitivity of colouring reduction (see Remark E]^a)) for 

< v < /i < A < oo. Specifically, for u = 1, this yields that (B^ U ^ X ^ ldc ,B\) = (B U ,B\). For 
= 1 < fi < A and v < fi < A = 1, this argument can be combined with the uniqueness of the 
divisor distribution (Lemma II ip . This completes the proof of (ii) =^ (i). 

We identified ((3\,\ > 0) in (i) (ii). The same reasoning as in Remark 1141 allows us to 
combine (i) here and Theorem [T] to see that (q,K,f3) is the (1 — l/A)-reduced triplet associated 
with (q\, k\, f3\). The existence of (B\,A > 0) now follows from Kolmogorov's consistency 
theorem. Uniqueness of the families (q\, A > 0), A > 0) and (/3a, A > 0) for each f3 = f3\ G 
(0, oo) follows from the uniqueness results in Theorem Q] and as shown in (ii) (i). □ 



3.2 Reconstruction procedures and backbone decomposition 

If K\(dz) = f\(z)dz is absolutely continuous for all A > and C = Ci + " ' + Cg(<*) ~ K n f° r 
independent Q ~ k\ and ~ Geo (a) for a as in Section 12.3.21 we find conditional joint 
distributions as in (fT9l) 

f\( z - E "=i U]=l fxivj) 



/Cl,..,Cn-l|G(«)=n,C=^^ 1 ' • • • ' ^-i) - ,*(n), A ^ 



for > with yi + • • • + ?/n-l < z, n > 1, where / A n " ; is the nth convolution power of f\. 



3.2.1 Reconstruction procedure for GW(q\, K\, /S^-bushes 

For B^ ~ GW^,^,^) the procedure in Section [2.3.31 with Exp(c,\) replaced by k\ and (fT9|) 
replaced by ([25]) constructs .E>a ~ GW(gA, ^a, /3a)- 

In the general case, one could use regular conditional distributions. Alternatively, we can 
adapt the procedure in Section 12.3.31 using the subordinator (random walk) representation of 
geometrically infinitely (finitely) divisible distributions as explained below. 



3.2.2 Reconstruction procedure with subordinators in the case where n is g.i.d. 

In the g.i.d. case, let M = Uce(o,oo){C} X ©([0, C],[0, oo)), where D([0, C], [0, oo)) is the set of 
functions /: [0, (] — > [0,oo) that are right-continuous with left limits, equipped with the Borel 
sigma-algebra generated by the metric topology induced by 

h), (C 2 , / 2 )) = ICi - C2I + dsMli- A Ci), / 2 (- A C2)), 

where dsk is a metric that generates Skorohod's topology on ©([0, 00), [0, 00)), see e.g. [35l 
Section IV. 1]. Consider a subordinator (a(t),t > 0), under P, such that a(V) ~ k,\ for an 
independent V ~ Exp(l). Define the following measure on No X H 

Q M ({j} X H) = q^j) ( P((z, (a(t), 0<t<z))€ H)c^ z dz. (26) 

J (0,00) 
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Now consider a bush of ~ Poi(/3 M ) random trees with distribution Pq m as defined in the 
branching property of Section 12.1.21 for the measure just defined. Then the reconstruction 

procedure of Section 12.3.31 can be applied subdividing subordinator lifetimes Q. ~ Exp(c^) 

rather than directly the population lifetimes (ffl ) ~ k^. Also define 

<r$ m (t) = + • • • + cgn-i +t) - a«(cS + • • • + d;Li), < f < di, 1 < m < ct i) . 

The remainder of the procedure is easily adapted. The resulting B\ is a bush of N\ ~ Po1(/3a) 
random trees with distribution Pq> a . 

3.2.3 Reconstruction procedure with random walks in the case where ip'(oo) < oo. 

In the case where k is geometrically divisible up to a (l) = ^'O^^WM > 0, let H = 
UnGN{ n i x [0) oo) n+1 be the space of random walk paths. On No x H, consider 

oo 

Q* W ({j} xH) = g M 0') < k <n)) e H)(l - a^))^ 1 a (fi) , 

n=l 

where, under P, (a(k),k > 0) is a random walk with a(k + 1) — o~(k) ~ Kqo i.i.d. In fact, in 
this case, the distribution k m exists since the Laplace transforms of k\ converge as A — > oo to a 
completely monotone function continuous at zero. Then the reconstruction procedure of Section 
12.3.31 can be applied subdividing geometric "random walk lifetimes" G$ ~ Geo(ao(/u)) into 
Gu = Gu\ + " " " + C n (c,i)j for independent G$ m ~ Geo(ao(A)), rn > 1, and G^* ~ Geo(a). 
Also define 

4U k ) = 4\gI\ + --- + + fc) - 4\g% + • • ■ + gJJLo, o < fc < g£L- 

The remainder of the procedure is easily adapted. The resulting B\ is a bush of A^ ~ Poi(/3^) 
random trees with distribution Pqrw. 

3.2.4 Backbone decomposition of supercritical Bellman-Harris processes 

The reconstruction procedures that build a GW(qa> ^a, (i\)-h\xs\\ B\ from a GW(g )1 , k m , /? M )-bush 
give rise to decompositions of the associated Bellman-Harris process = #Tf t {B\) along the 
GW(g M , k^, /? M )-bush 1?^. In the sequel, we will write BH((/a,ka) for such a Bellman-Harris 
process and when we specify its initial distribution, all these individuals are taken with zero age. 

In the supercritical case ip'(0) < 0, note that the GW(g[ 1 re ^ A ' rdc , KA)-trees with offspring 
distribution as in (|20p that are grafted onto B^ are subcritical for all < fx < A. The case fi = in 
is at the heart of many decompositions in various settings, mainly continuous analogues with and 
without spatial motion, see [H 111 } [Po l 116]. As an immediate consequence of our reconstruction 
procedures, we obtain a version of the backbone decomposition for Bellman-Harris processes. 

Corollary 19. Let ip be a supercritical branching mechanism, Bq a bush of Nq ~ Poi(/3 , i/ ;_1 (0)) 
random trees with distribution Pq as in (J26]). Subdivide each subordinator lifetime as in Section 
\3.2.2\ to get a bush B$. For each u £ Bq independently, given v u = m children, consider 
a BH(gj v ~^ c , k\) -process with of distribution ([2T|) . Also consider a BH(^~^ C , k\)- 
process Z TOOt with Z r oot ~ Poi(/3('0 _1 ( A ) - V*" 1 ^)))- Then the process 

z t = #w t (B ) + zr ot + E Z t-lu 

u£Bo : uj u <t 

is a BH(q\, k\) -process with Zq ~ Poi(/3^ _1 (A)). 
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3.3 Limiting trees and branching processes as A — > oo 

3.3.1 Convergence of trees: R-tree representations, Levy trees and snakes 

A random marked tree T x with distribution Pq a as in Section T3.2.2I specifies marks ( u ~ Exp(c A ) 
as well as cr u ((u) ~ K x- Therefore, we can associate coupled trees 

T\ = {(«, Cu) : (u, (u, a u ) G T A } ~ GW(q x , Exp(c A ), /3 A ) 
and T x = {(it, <J u ((u)) ■ {u,(u,cr u ) G T A } ~ GW(g A , k x , /3 a ), 

that only differ in their lifetimes. In the same way, we obtain coupled bushes and consistent 
families (B X ,X > 0) and (-£>*, A > 0). Several other representations of (T X ,T X ) are natural. For 
an R-tree representation 

r A ° = Mu UWx(«>:i 

«6t a 

of T x , with root p = (0, 0) and metric d given by d((v, s), (w, t)) = \t — s\ for v ■< uu or w ■< v , 

d((uiv, s), (ujw, t)) = s + t — 2w° for u, v, w G U and i, j G N, i / j, 
see [TT1 Sect. 3.3], we can consider the measure 

W\({u} x (a, 6]) = a u (b - a°) - a u (a - a°), a° u < a < b < u° a ,u £ r A , 

which for a u = a as in ([H])-©, u G r A , is of the form W A = dLeb + 7?. A , where 7^. A is an infinitely 
divisible independently scattered random measure on 7^° , in the sense of |28} [391 HTj , with F- 
measure Leb<8>A, where Leb denotes Lebesgue measure on T x °- For technical convenience, we can 
embed (T x °,d,p) as a metric subspace of ^i(N) = {x = (x n ) n >i: x n G [0, oo),n > 1; ||x||i < oo}, 
where ||x||i = Xll x nl ^ s the £i-norm, with root G ^i(N). We can embed bushes (B^,X > 0) 
consistently following [11, Remarks 4.9-4.10]: in the (sub)critical case, this family starts from 0, 
is piecewise constant and evolves by adding single branches, so we can represent the jth branch, 
of length L° say, as [[x(j),x(j) + L^ej}} for some x(j) = . . . , Xj-tij), 0, . . .) G B° Xj i and 

ej = (0, . . . , 0, 1, 0, . . .) the jth coordinate vector in ^i(N). Similarly, we can embed (£>*, A > 0) 
for certain L* = aj(L°). The supercritical case can be handled using [TTJ Proposition 3.7]. 

For the £i(N)-embedding, we can consider W A as a random measure on £i(N) (with support 
included in the embedded B° x ). Then B° x ' wt = (B° X ,W X ) is a weighted R-tree in the sense of [171 
[19], in the (sub)critical case, but weak or vague convergence as A — > oo are not appropriate since 
in the limit the measures become infinite on any ball around a point in some B° x . Nevertheless, 
consistency implies that in the case where convergence to a locally compact and separable 
Levy bush B° occurs in the Gromov-Hausdorff sense [111 Theorem 5.1], the random measures 
W A consistently build an infinitely divisible independently scattered random measure W on 
B° C ^i(N) whose Poissonian component still has F-measure Leb|#° ® A, while the continuous 
component is still dLeb|go and where Leb is one-dimensional Lebesgue measure on ^i(N). 

For x G B° consider the path [[0,x]] = {y G B° : y n < x n , n > 1} = {f x (t): < t < \\x\\i}, 
where ||/x(t)||i = t, < t < \\x\\i. Then S: B° -> M with M as in Section [3X2] given by 
S(x) = (||x||i, (W([[0, f x (t)]]),0 < t < ||x||i)) can be seen as a snake in the sense of [35] Section 
1.3.2], where the spatial motion here is a subordinator with characteristic pair (d,A) as in ([9]). 
See [25] for another setting where discontinuous snakes appear naturally. 

Proposition 20. In the setting of Theorem^ if B° is separable and locally compact, then B* , 
the closure of\JB x in ^i(N), is separable, but locally compact only if A = 0. Also, if B° is 
furthermore bounded, then B' is bounded if and only if A = 0. 
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Proof. Separability of B* is trivial as B* x is separable. Now argue conditionally given (£>^)^> . 
To show that boundedness and local compactness fail unless A = 0, first consider A = 5h, the 
Dirac measure in h > 0. It is not hard to show that the subtree of B° above every vertex, except 
leaves, has infinite total length. As a consequence, there will be Ai > for which Wx-^ has an 
atom x\, and the subtree {y G B° : x\ G [[0, y[[} above x\ has infinite length. Inductively, we 
find an increasing sequence x\ < x<i < • • • of atoms of W showing that B* is not bounded. 

Now assume that B* is locally compact. By the Hopf-Rinow theorem [21] . closed balls 
are then compact. Since B° is locally compact, [111 Remark 5.1] implies that [11, Theorem 
5.1] applies. In particular, ip'(cc) = oo, so that B° \ [[0, Lfei]] has infinitely many connected 
components, each of infinite total length, so that they all contain atoms of W. However, L\ = 
a\{L\) is bounded and for a large enough ball with B{r) = {x £ B' : \\x\\i < r} D [[0, LJei]], the 
set B{r + h) \ [[0, £*ei]] also has infinitely many connected components each exceeding diameter 
h. Considering a cover of B(r + h) by open balls of radii less than h/2, there is no finite subcover, 
as each connected component needs at least one ball that does not intersect any other connected 
component, which contradicts the compactness of B(r + h). So B* is not locally compact. 

For any A / and h > with A((/i, oo)) > 0, let A = A((h, oo))5 h , couple B* and B* and 
argue as above that B' is not bounded nor locally compact, then deduce the same for B*. □ 

Since B* is separable for a large class of branching mechanisms and lifetime subordinators, 
the framework of [IH] can be used to further study these trees. 



3.3.2 Superprocesses, backbones and convergence of Bellman-Harris processes 



Sagitov [40] studied convergence of Bellman-Harris processes to certain non-Markovian CSBP 
whose distribution is best described via Markovian superprocesses that record residual lifetimes, 
see also [27] • Specifically, a (£, K, ^)-superprocess [13] is a Markov process M = (M t ,t > 0) on 
the space of finite Borel measures M([0,oo)) with transition semigroup characterised by 



E 



exp 



'[Q,oo) 



f{z)M t {dz) 



Mi 



exp 




u t - z (f)m(dz) 



f{z-t)m{dz)\ , (27) 



(i,oo) 



for all / : [0, oo) — > [0, oo) bounded continuous, where ut (/) is the unique (at least if ^'(O) > 
nonnegative solution of 



Ml) 



I 

J {OA 



^(u t - s (f))dH s =E(f&)) 



-oo 



(28) 



and H s = E(K S ) is the renewal function of a strictly increasing subordinator or random walk 
a, here in terms of the local time process K s = inf{t > 0: a(t) > s}, which is an additive 
functional of our particular choice = cr(Kt) — t of Markovian spatial motion. Since branching 
only occurs at the origin, M is a catalytic superprocess 0Q3]. We call the associated population 
size process Z t = M t ([0, oo)) a CSBP(ET, tp). Such processes appear as limits of Bellman-Harris 
processes, also in our setting. Unless stated otherwise, we understand Z = (3 as M = /3So. 

Proposition 21. Let Z^ = #Tf t (B\) for a consistent family (B\)\> of GW (q\, k\, f3\) -bushes 
with branching mechanism ip as in Theorem^ Suppose that ip is subcritical or critical, i.e. 
ip'(0) > 0, and that ijj'(oo) = oo. Let a be a subordinator as in Section \3.2.2[ Then, for allt>0 



1 



Z f 



almost surely, as A 



oo, 



where (Z t ,t > 0) is a CSBP (if, tp) starting from Z = (3. 
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Proof. The almost sure convergence follows from the same martingale argument as in Lemma 
[181 The identification of the limiting process follows from |401 Theorem 1], see [27] for a proof. 
Specifically, we consider the limit along the subsequence (A n ,n > 1) for which -0 _1 (A n ) = n. □ 

Proposition 2] states that the conclusion of Proposition [21] holds in the supercritical as well 
as tp'(oo) < oo cases. Sagitov [40] announces his convergence result to include the (finite- 
mean) supercritical case, but the proof in Kaj and Sagitov [27J only treats the subcritical and 
critical cases. They say that the supercritical case would require some further assumptions and 
additional work. In our less general setting, this is not difficult - our proof does not rely on [27J . 

Proof of Proposition [4] In the g.i.d. case ip'(oo) = oo, we use Lemma PTOl to represent k\ in 
terms of a strictly increasing subordinator a as k\ = P(a(V\) 6 •) for V\ ~ Exp(^'(^ _1 (A))). 
We introduce Markovian measure-valued branching processes M x that do not record residual 
lifetimes of Z x , but what we may think of as limiting residual lifetimes (in the limit A — > oo), 

M t = Yl 6 ^(K?_ a . )-(*-<) = Yl 6 *(x)-t, where a(x-) = W A ([[0,x[[), 

ueWt(Bl) iEgB°: a{x-)<t<a(x) 

using notation as in Section [3.3.11 and also = inf{i > 0: cr u (t) > s}. By (27] Lemma 3] or 
[13] Formula (1.5)], for the Markov process & = a(K t ) — t, the semigroup of M x is such that 




M x = m | = exp J - / vl z (f)m(dz) + [ f(z-t)m(dz) \ , 

/ J\0,t] J (too) 



f{z)M t x (dz 

[Q,oo) J J [ J[0,t] J(t,oo) 

[0, oo) bounded continuous, where v x (f) satisfies 
-vHf) = E (l - e -/M**)-*)) - J ^ ^ A ( e -«t s (/)) _ „-<.(/)) rfE^), 



e 



and where N x = #{k > 1 : i? A < s} is the renewal process associated with a random walk with 
Ru — R^i ~ K\. It is easily checked that K(N X ) = H s . The remainder is straightforward (cf. 
P2 Section 1.2]). We apply ^ to see that u x (f) = V -1 (A)(1 - exp{-^(//V> -1 ( A ))}) satisfies 

«*(/) + / ?P(ul s (f))dH s =E(i;- 1 (X)(l-expt — ' ) 



'[o,t] V V L -0 1 (A) 

Uniqueness in (j2SJ) means that u A (/) = u t (f\), where /a = ^~ 1 (A)(1 - e~^ _1 W) f /• By the 
Monotone Convergence Theorem, this implies for iV A ~ Poi(/3^> _1 (A)) and Mq = N\5q that 

eLpLJ /(^^Wjj = exp {_ / 3 V ,-i ( A)(l-^(//^- 1 (A)))}=exp{-/3 Ui (/ A )} 



E exp - / A («)M t (dz) 

'[0,oo) J 

E(exp<J-^ /(*)M t (<fe)J 




M = /35 = exp{-/3^(/)}. 



In particular, for Z t A = M t A ([0,oo), we obtain Z^/^- 1 {X) -> Z t , where Z is a CSBP(if,^). The 
martingale argument of Lemma [18] establishes almost sure convergence in the case ^'(0) > — oo. 
In the finitely geometrically divisible case, we use bushes B\ based on measures Q^ w and 

M t x = V S a j K u e j_ (t _ a .), where K% = inf{£; > 1: a u (k) > s}. 
Then the argument above is easily adapted. □ 



22 



As an application, let us derive a backbone decomposition. This should be useful to deduce 
more general supercritical Bellman-Harris convergence results from subcritical results. Our 
present paper is not about superprocesses nor convergence of general triangular arrays, so we 
do not push for highest generality nor assumptions as in [27], but we would like to mention a 
now natural approach - in a sense to be made precise, convergence of supercritical processes is 
equivalent to convergence of backbones and convergence of associated subcritical processes. 

We write P^ ^ for the distribution of a CSBP(X, ip) starting from r > 0. Just as for CSBP^) 
in Section 12.2.2} we consider the sigma-finite measure &K,ip such that P ^ ^ is the distribution 
of a sum over a Poisson point process with intensity measure rQx,ip- 

Theorem 22 (Backbone decomposition for CSBP(if, i/j)). Let tp be a (non- explosive) supercrit- 
ical, ipo(r) = ip(r + ■i/> _1 (0)) the associated subcritical branching mechanism. Let Bq be a bush 
of Nq ~ Poi(/3?/'~ 1 (0)) trees with distribution Pq as in (f26|) . and, as in Section [3. 3.1\ (Bq, Wo) 
a representation as a weighted M-tree, o~(x) = Wo([[0,x]]). Given (£>q,Wo), consider 

• points (Z x ,x £p) of a Poisson point process in D([0,oo), [0, oo)) with intensity measure 

Q(df,dx) = [200*,*, (40 + / ^(djye-^Wllidr) ) Leb\ B ° Q (dx), 

• extra points (Z x ,x G Br(£>g)) independent of (Z x ,x G V) with distribution 

m (°)l • y (°' o °) m (°)l 

where l(x) + 1 is the number of connected components of Bq \ {x} and Br(i3g) is the set of 
branchpoints {x £ Bq: l(x) > 2} \ {0} ; 

• and an extra point Z° independent of (Z x ,x G V U Br(Bg)) with distribution P^-^ - 
Then the process Z t = ^t-a(x) ^ s a CSBP(K, ip) starting from Zq = j3. 

a;G'PUBr(Sg)U{0} 

Proof. Since Z is not a Markov process, we will deduce the theorem from the richer structure 
of a Markovian (£, K, -0)-superprocesses M starting from Mq = m. Slightly abusing notation, 
we consider M also under ®k,4> an d P/^ and Q. Then the intensity measures and distributions 
in the bullet points specify a point process (M x ,x G V U Br(£>g) U {0}). From the exponential 
formula for Poisson point processes and from (I27D for the subcritical branching mechanism tpQ 
with uf lh (f) associated via the analogue of (|28p . it is not hard to calculate 



E exp < 



B° , Wo) = exp \- /j^ (n- b CT( ,)(/))-V'o(0))Leb(dx)| , 

4°K^(,)(/)) 



w= n .<«>, 0) 



E(exp{-/ /(z)M t °(^)l B5,Wo) = exp|-/ uft b z (f)m(dz)- [ f(z-t)m(dz)\, 



[0,t] J(t,oo) 



using the convention u s s uh (f) = for s < 0. It now suffices to show that the backbone decompo- 
sition of approximations M*~ of M, cf. Corollary 1191 appropriately converges to these quantities. 
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Let us formulate that backbone decomposition in the current setting. Let be the distribu- 
tion of M x given Mq = N^5 with N{ ~ Poi(V> _1 (^) r )- Then, given (Bq,W ), consider points 
(M x ' X ,x G V x ) of a Poisson point process with intensity measure 



2a(^" 1 (A) - V'- 1 (0))P(M A G -|M A = 5 ) + / re-^'^n^r), 



and 



lrf (x)) (o)l 



(0,oo) ° |^(0)| 



(0,oo) 



(/ir)|i| n(dr) forxGBr(^)andM°' A ~P^ o . 



Then the analogous calculations yield for Uf' suh (f) = i/j 1 (A)(1 — e Vt ' 

r x,A 

, ..A 11 : ) I 

£ o ,W | =exp 




[0,oo) ^ ( A ) 



[o,oo) /U ^- X (A) 



(%(<r CT( , ) (/))-^o(0))Leb(dx)j 

(/)/ A,sub 



S ° 5 Wo 



n 

:reBr(Bg) 



C (<%(/)) 



4°(o) 



E exp 



[0,oo) V ( A ) 



[o,t] 



(t,oo) 



/(*-i)m(<teH , 



But from the proof of Proposition U we know that u A ' sub (/) — )• uf lh (f) as A — )• oo, and this 
completes the proof. □ 

We can specialise this backbone decomposition to the case Kt = t, when a CSBP(i<C, -0) is 
simply a Markovian CSBP(?/j) and cr(x) = d(0, x) is just the height of x G Bq. In this framework, 
and even with a spatial motion added, this decomposition was obtained recently by Berestycki 
et al. [1], generalising an analogous result of Theorem 5.6] formulated in a context of Levy 
trees. 



4 Growth of GWl(q\ : K\, r]\, xa) -forests: immigration 

Theorem[3]is about forests F\ of GW(q\, Exp(cA))-trees arising from immigration of independent 
^-distributed numbers of immigrants at Exp(/i / x)-spaced times. The main statement beyond the 
no-immigration case of Theorem[T]is that consistency of a family (F\, A > 0) under Bernoulli leaf 
colouring relates r]\ to a continuous-state immigration mechanism 4>. After some more general 
remarks, we focus on the Markovian case of exponential lifetimes and inter-immigration times. 



4.1 A two-colours regenerative property and associated forest reduction 

Let F = (B(t),t > 0) be a GWI(g, k, r], x)-forest as defined in Section l2.1.4l specifically denote by 

n(i) T (i) > 

■(!)'••• ' J (7ViV 



Si = inf{i > 0: B(t) ^ d} ~ x the first immigration time, by B(S\) = (t£9, . . . , T/A J the bush 



of independent genealogical trees tQ) ~ GW(q, k), j > 1, of the N\ ~ 77 time-Si immigrants and 
by F post = (BP° st (t),t > 0) the post-Si forest given by £P° st (0) = d and B^ ost {t) = B(Si+t) 
for t > 0. It is immediate from the definition that F satisfies a regenerative property at S\ in 

that (/Si, B{S\)) is independent of -Fpost and F post — F, and that the distribution of (S\, B{S\)) 
as above together with this regenerative property characterises the distribution of F. Since 
colouring and reduction apply tree by tree, we obtain for the associated forest F p ~ co1 of coloured 
trees T^ p ~ co1 ~ F£f, 1 < j < N t , i > 1, a 
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Regenerative property of coloured GWI(g, k, rj, x)-forests 

(a) For all n > 1, Sj G {0, 1}, and measurable functions k, fj and G, 1 < j < n, we have 



E M^i)G(^- t co1 ) II MT^N, = n; U(T^ co1 ), . . . , ^(T^ 01 )) = (e 1 ,...,e n ) 



3=1 

n 

/ k(z) X (dzHn)g( P r(l - g(p)rE(G(F^))l[^ q -f[f^ = ej \, 

J(0,oo) 



'(0,oo) j=1 

where n r = E\ + • • • + e n and rib = n — n r are the numbers of red and black colours. 

(b) For t > 0, consider the post-t forest F^_ t = (BP- col (t + s), s > 0) and the pre-i sigma- 
algebra Ft = cr{-B p ~ col (r), r < t}. Then for all measurable functions f u , u G U, and G, 



u€Wt(F) 



F t \ = E(G(FP- co1 )) [] E£~f [/ U (0 0)S )|C > s] 
ue7f t (F) 



s=t—a v 



As a trivial application of (a), we can calculate the probability that all immigrants are red 

P(( 7 0(Tf 1 7 col ) ) ... ! 70(Tf J 7 1 7 1 )) =(1,...,1)) =Y J V{n)9(vY = vMp))- 

n>l 

We deduce the distribution of the number of red immigrants given that all immigrants are red 

p-coi, v / tiff* ' [im >^ y r y ^ ^(sg(p)) , 9Q , 

?jL (m) = < VWWJ with generating function 09 p-coi(s) = — ' . . (29) 

[0, if 771 = 0, "red t Pv(9(p)) 

Also by the regenerative property (a), the number G of immigrations until we see the first black 
immigrant is geometrically distributed with parameter 1 — (p v (g(p)), i-e. 

F(G = j) = ^(jp))^ 1 ^ - <p v (g(p))), j > 1. (30) 
Conditioning on having at least one black immigrant (probability 1 — ifn^gij)))), we get for £ > 1 



^- rdc w = r ^(9(p))Ef m+ 

1 - ^ V m J 



V (m + £)(g(p)r(l-g(p)) 



e _(l-g(p)M%(p)) 



Similarly, conditioning on having £ black immigrants, £ > 1, we get for the number of red ones 
%(m)=7/(m + l) l , ; " (ff(p)) w ^ > form>0. (31) 

These distributions have generating functions that we can express in terms of ip v , for s S [0, 1] 

/ v fyigip) + «(1 ~ g(p))) ~ ^(g(p)) , , v Vy Hsgjp)) 

ip vP -rac{s) = — — and (p^{s) = ' , £ > 1, 

and as </> P -rdc and ^ are analytic, we can extend c/?»? analytically to [— g(p)/(l — g(p)),l]. 
Evaluating at s = v p = -g(p)/(l - g(p)), we get <p v {g(p)) = -¥y-«ic(v p )/(l - w-rdc('Up)), so 

1 — V^p-rdc(Up) 

As reduction preserves the Galton- Watson property for trees [T1] , we now see that a p-reduced 
GWI(g, k, r], x)-forest is a GWI(g p ~ rdc , K p " rdc , rf- rdc , x p ~ rdc )-forest. Specifically, rj p - rdc is as 
above, %P~ rdc the distribution of a geom(l — tp v (g{p))) sum of independent x-distributed vari- 
ables. 
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4.2 Growth of GWI(g A , Exp(c A ), rjx, Exp(/i A ))-forests 
4.2.1 Proof of Theorem [3] 

(i) (ii): Suppose, (i) holds. In particular, is then the (1— ^/A)-reduced offspring distribution 
associated with q\, for all < fi < A < oo. By Theorem [JJ (p q (s) = s + ip(l — s), where tp has 
the form ([2]). To be specific, let c = 1 and parametrise (q\, c\) using ijj as in Section [2.3.11 

Now consider the relationship between rj = rju and r/ A for A > jj, = 1. By the discussion above 
(|3"2"j) . we can extend y> n analytically to [— g\(l — 1/A)/(1 — g\(l — 1/A)), 1], where, expressing as 
in (fT6j) . we have g\(l — 1/A) = 1 — ip~ x (l)/^ -1 (A) — ^ 1 as A — > oo. Differentiating (f32j) . we see 
that ^ has positive derivatives on (— oo, 1). Setting 0(r) = 1 — ip v (l — r), r > 0, the derivative 

(f)' is completely monotone on (0,oo) and, by Bernstein's theorem (see e.g. [E]), there exists a 
Radon measure A* on [0, oo) such that 



0'( r ) = / e~ rx A*{dx) < oo, r > 0. 

J[0,oo) 

From 0(0) = 0, we get integrability » x~ 1 A*(dx) < oo and 

00) = 0(0) + / 0'(r)dr = A*({0})« + / A*(dx), 

and, in particular, setting d = A*({0}) and A(cfe) = x _1 A*|( oo )(dx) yields (ii). 

(ii) (i): Now suppose that (ii) holds. According to Theorem [TJ the family of offspring 
distribution (q\,X > 0) exists as required; furthermore, we can express (p„ x in terms of ifi as in 
Section [2.3.11 choosing c = 1. 

By (HU), wehave 5A (l-/i/A) = l-^" 1 ^)/^" 1 ^), so ^, A = - 9A (l-/i/A)/(l- 5A (l-/i/A)) = 
1 — ?/ ;-1 (A)/'i/' -1 (A*) for all < /i < A < oo. By ([32]) and the discussion above (f32]) . the required 
immigration distributions must be of the following form, respectively for \i < 1 and A > 1 

_ y„foi(l-M ) + «(1 - 5i(l - A*))) - - A*)) _ ! " s)i>- l [n)/i>- l {l)) 



_ ^K> + r(l-^))-^M = 1 ^(l-r^-HA)/^-^!) 

i-^Ka) 0(v>~ 1 (A)/v>- 1 (i)) 



Since 0' is completely monotone, simple differentiation yields that these functions are indeed 
generating functions of immigration distributions. Furthermore, rjn is the (1 — /i/A)-reduced 
immigration distribution of r/ A for all < fi < A < oo, by the transitivity of colouring reduction 
noted in Remark [7(a), which also applies to forests, since colouring and reduction are defined 
tree by tree. The full statement of (i) can now be obtained formally as in the proof of Theorem 
[2j with the simpler regenerative property here taking the role of the branching property there. 

In the setting of (i) and (ii) for c = c\ G (0, oo), h = hi £ (0, oo), Kolmogorov's consistency 
theorem allows us to set up a consistent family (F\,\ > 0) of GWI(g A , Exp(c A ), rjx, Exp(/i A ))- 
forests. Uniqueness of (qx,cx), A > 0, follows from Theorem [TJ Uniqueness of (r/ A ,A > 0) 
was noted in (ii) =>• (i). Uniqueness of (/i A ,A > 0) follows from the relationship between inter- 
immigration times as geometric sums, where we calculate hx from (|30p for A > 1 > [i as 

h h 
h\ = (1 — co n . (<7 A (1 — l/X)))h = — and h u = — . 

□ 
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4.2.2 Freedom in parameterisation and standard choice 

In analogy to Section 12.3.11 we can use a single function <j> to replace ((f), h) of Theorem [3] and 
parametrise (rj\, Exp(/i^)), A > 0, such that 

^^'-^^FwT 2 a,ld (33> 

where <f> is a linear transformation 0(s) = k^ik^s) of 0. Specifically, we choose k% = h and 
&4 = It is easy to check that this works, using (f)(1) = 1 — 77(0) = 1. 

In this parameterisation, we can also express in terms of ip and (f> the remaining quantities 
studied in Section 14.11 E.g. ([33]) and (|29|) now yield the generating function of the pure-red 
immigration distribution 

M^ /A) - rol(s) = ^(A)-^- 1 ^) ' ( } 

The parameters of the geometric distributions (|30[) take a simple form that leads to a distinction 
of finite/infinite immigration rate ^j^^ 11 — 1 — ^P^xi^Qxi^ — M/^)) — ( /K'0 

• a JX > as A ^ oo, if <£(oo) < oo; 

• ajf 1 ™ — ► as A — ► oo, if 4>(oo) = oo. 

With the formulas above we can formulate explicitly a reconstruction procedure. 

4.2.3 Reconstruction procedure for GWl(q\, Exp(ca), r]\, Exp(/iA))-forests 

For ~ GWl(q fl , Exp(c At ), rj^, Exp(/i^)), we modify the steps of Section [2.3.31 to construct F\. 

1. In every tree of F^, subdivide lifetimes as in Section T2.3.3I and hence construct a forest F^. 

2. In every tree of F M , add further children and independent red trees as in Section [2.3.31 and 
hence construct a forest F\. 

3. At every immigration time, given that there are = £ immigrants in F„, consider a ran- 
dom number ivl? ed ~ rji of further immigrants as in (I31j) . proceed as in Section \2 . 3 . 31 and 

superpose a further independent GWI(q^ T J^ c °\ Exp(c,\), V\^ T ed^ C °\Exp(/ix — h^))- 
forest with distributions as in (|20j) and (134j) to finally obtain F\. 



4.2.4 Convergence of the population sizes: proof of Proposition [5] 

For convergence in distribution, we calculate the Laplace transform of Y t x j 'if;' 1 (A) , where 

Jt Jt N * x 

Yf = Tf t (F x ) = ^T t _ st (Ba(S?))=EE^ ( T £) A ) 

i=l i=l j=l 

with notation as in and around (|7|), but with all quantities A-dependent. We exploit that 
E\S?) = Z^ x = (w t (B x (S?)),t > o) , E x (s) = 0, a {s*,i > l} , 
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is a Poisson point process with intensity measure h\ times the distribution of a GW (q\,Fixp(c\))- 
process Z^ ,x starting from Zq^' A ~ tj\. By the exponential formula for Poisson point processes, 

-h x / Yl ^( m ) i 1 - ( E { sZLv )T) dv r ' 

m=l \ J 

where s = e~ r ^ 1 ^ and Z A is the population size of a single GW(q\, Exp(cA))-tree as in the 
proof of Lemma [TBI Using notation and asymptotics from there, as well as (|33]) . this equals 

h x f (l - tp nx (wl v (s))) dv\ = exp [- C <t>(y-\\) (l - «£_„(*))) d« 



exp 

Since ■0 _1 (A) (l — w^_ v (s)) — > u t - v (r), and, by ([22]) and ([23]) . all these quantities are bounded 
by max{r, ■0 _1 (O)}, dominated convergence completes the proof of convergence in distribution. 

Almost sure convergence follows by martingale arguments as in Lemma [TBI using a version 
of the regenerative property (ii) of Section T4. II rather than the version of the branching property 
^) that we presented in Remark 1 16lfc). □ 

From Proposition [5] and Lemma [18] we deduce the analogous convergence result for GWI- 
processes starting from initial population sizes y A ~ Poi(/3 A ). The limiting CBI then has Y = (3. 

4.3 Analogous results for G\Nl(qx, Kx, rjx, XA)-forests 

Finally, let us combine Theorems [5] and into a single statement and also deduce the analogous 
pattern for general inter-immigration distributions that now emerges naturally. 

Corollary 23. For a tuple (q, n, rj, x) of offspring, lifetime, immigration and inter-immigration 
distributions, the following are equivalent: 

(i) There are (q x , k x , n x , Xx)x>o with (q x , m, rji, Xl) = (q,K,V,X) such that (q^, k^, r]^, is 
the (1 — /i/ X) -reduced tuple associated with (qx, kx, r]x, X\), f or all < fj, < X < 00. 

(ii) The generating functions ip q of q and ip v of r\ satisfy ip q {s) = s + Tp(l — s) for some ip of the 
form ([2]) and (p v (s) = 1 — 0(1 — s) for some <p of the form ([3]); k is geometrically divisible 
for all a > l/ip'(oo) if ip'(oo) < 00, or for all a > if ip'(oo) = 00. Moreover, x is also 
geometrically divisible 

• for all a > l/0(oo) if <f>(oo) < 00; 

• for all a > if (f>(po) = 00. 

In the setting of (i) and (ii), a consistent family (F\)x>o °f GWl(qx, K\,r]x,Xx) -forests can be 
constructed such that (F^, Fx) — (F^ M ^ rdc , F\) for all < fi < X < 00. 

We omit the proof which is a straightforward combination of the proofs of Theorems [2] and [3] 
Similarly, the convergence results of Proposition [J] and [5] find their analogue in this setting: 

Corollary 24. Let (Y t x ,t > 0) be the population size process in the setting of Corollary {E 
Then 



Yt in distribution as X —> 00, for all t > 0, 



where (Y t ,t > 0) is a CBl(K,tp,K,<p) with Y = 0, for 

• branching mechanism ip a linear transformations of tp as in (T5\ 
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• K = (K s , s > 0) with infjs > 0: K s > V\} ~ K\ for V\ ~ Exp(c,\) with c\ of Theorem^ 

• immigration mechanism <f> a linear transformation of <f> as in A33\) 

• K = (K s ,s > 0) with inf{s > 0: K s > V x } ~ Xx for V\ ~ Exp(/i A ) with h\ of Theorem^ 

If furthermore ip'(0) > — oo and <f>'(0) < oo, then the convergence holds in the almost sure sense. 

Like a CBI(^>,<^>), a CT5I(K , i[) , K , can be constructed from a Poisson point process (E s ,s > 
0) in B([0, oo), [0, oo)) with intensity measure d&K,tp + fr ^ ^A(ds), where (d, A) are the 
characteristics of in Also consider an independent subordinator a or increasing random 
walk, in the (f>{po) < oo case, in fact K s = inf{i > 0: a(t) > s} clarifies the notation and the 
meaning of K. Then 

is a CBI(K,V,i^,0) with F = 0. Like a CSBP(iT,V), a CBI(K, ip, K, 0) is non-Markovian, 
but admits a Markovian representation (M, i?) that records residual lifetimes as well as residual 
times to the next immigration, with values in M([0, oo)) x [0, oo), where #t = d{Kt) — t, so that 



E exp - f(z)M t (dz) - r# t \ 

'[0,oo) I 



Mq = m, i?q = s 



E( exp<( - / ut- z {f)m{dz) - I f(z - t)m(dz) 

'[Q,t] J(t,oo) 

<P(u t _ s _ z (f))dK z - r (d(K (t _ s)+ ) - (t - s)) 

[0,t-s] V * 

where ut(f) is the unique nonnegative solution of (j28|h Then the process Y t = M% ([0, oo)) is a 
CBI(i^, ip, K , <j)). We leave any further details including the proof of Corollary 1241 to the reader. 
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